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Short Measures 
of Personality Traits 


In recent years, there has been a growing interest in short versions of ques- 
tionnaires for measuring personality (Baldasaro, Shanahan, & Bauer, 2013; 
McCrae & Costa, 2007; Thalmayer, Saucier, & Eigenhuis, 2011). Several short- 
ened measures of personality published in English and German, showing satis- 
factory psychometric properties, have already gained popularity among research- 
ers. These are, for example: the Questionnaire Big Six Scale (24QB6; Thalmayer 
et al., 2011), being a 24-item questionnaire measuring six personality traits, as 
well as 10-item questionnaires measuring five personality traits: the 10-Item Big 
Five Inventory (BFI-10; Rammstedt, 2007); Ten-Item Personality Inventory 
(TIPI; Gosling, Rentfrow, & Swann, 2003); Mini-Markers (Saucier, 1994), being 
a 40-adjective version of Goldberg’s (1992) list of the best lexical markers of the 
Big Five, originally comprising 100 items (Big Five Factor Markers); or the 20- 
item Mini-IPIP inventory, being a short version of the 50-item questionnaire for 
measuring the Big Five personality traits (Donnellan, Oswald, Baird, & Lucas, 
2006) from the resources of the /nternational Personality Item Pool (IPIP). 

The advantages of shortened versions of questionnaires include low cost and 
the short time that it takes to fill them in (Herzberg & Brahler, 2006). This makes 
it possible to include personality measurement in studies whose time is limited, 
such as those conducted online (Gosling et al., 2003), or in studies with repeated 
measurement (e.g., longitudinal studies). Shortened versions are much less tire- 
some for respondents and therefore reduce the risk of errors resulting from acci- 
dental indication of answers (Thalmayer et al., 2011). Their application is also 
a good solution in the case of respondents who may have difficulties in reading 
(McCrae & Costa, 2007). 

Reise and Henson (2000) demonstrated that most of the variability in traits 
measured by the NEO-PI-R questionnaire on 8-item scales can be measured at 
a comparable quality level using 4-item scales. They used the CAT procedure 
(computerized adaptive testing). In this procedure, in order to maximize the pre- 
cision of measurement, the computer systematically selects test items for each 
particular person based on information following from the person’s earlier res- 
ponses (Weiss, 2004). 

Nevertheless, the construction and use of short versions of questionnaires in- 
volves certain risks. Reducing the number of items may result in lower reliability 
of scales (McCrae & Costa, 2007) or increase the risk of error in conclusions 
concerning the relationships between personality traits and other constructs 
measured. The items selected for the shortened version of the questionnaire, 
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measuring personality traits, usually represent them to a limited extent, which 
may make it difficult to demonstrate the relationships between the traits and oth- 
er variables (Credé, Harms, Niehorster, & Gaye-Valentine, 2012). For example, 
in the model proposed by McCrae and Costa (2007) each of the five basic perso- 
nality traits consists of six facets, and therefore a scale consisting of only three 
items cannot fully convey the specificity of a given trait. Using a scale thus 
shortened for explaining the variance of some other variable may lead to an un- 
derestimation of the prognostic validity of the personality trait that the scale 
measures. What is more, if some other construct is used for explaining part of the 
remaining variance of that variable, the predictive value of that construct may be 
overestimated. This would be a result of underestimating the initial variance ex- 
plained by the personality trait measured using the short scale and of underesti- 
mating the correlation of that trait with this construct (Credé et al., 2012). 

However, it is possible to reduce the above risks. Careful selection of items 
for short versions of scales makes it possible to achieve satisfactory reliability 
(Fischbach & Moosbrugger, 2007). Numerous studies have also confirmed that 
the use of an appropriate method of constructing short versions of questionnaires 
can result in the short versions having only slightly lower criterion validity than 
their full-length prototypes (e.g., Credé et al., 2012; Frazier, Naugle, & Haggerty, 
2012; Thalmayer et al., 2011). A good solution is to shorten scales to four items. 
McCrae and Costa (2007) showed that the mean from this number of items re- 
duces random error and systematic error and that using a balanced key, with half 
of the items reverse-coded, enables controlling the tendency to agree. 


Selected Ways 
of Shortening Questionnaires 


Various authors have used different techniques to select the optimal items 
from the basic version of a measure that would make up its shortened version. 
A few typical approaches found in the literature will be briefly described below. 

When creating the NEO-PI-3 shortened questionnaire for investigating per- 
sonality traits, McCrae and Costa (2007) used regression analysis. The dependent 
variable was the score on an 8-item scale and the independent variables were the 
items. The criterion of selecting items for the shortened version was the amount 
of variance explained by a given item in the regression analysis. 

Batinic, Wolff, and Haupt (2007) used a different method when constructing 
a shortened version of the Trendsetting Questionnaire (TDS-K). The procedure 
had two stages. The first one was selecting those items from each scale whose 
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correlation with the overall score on that scale was higher than .5. In the second 
stage, the authors selected items with the highest face validity. 

Another popular approach is based on factor analysis. It consists in selecting 
those items for the shortened version that have high factor loadings on the re- 
spective factors and low cross-loadings (Samson & Huber, 2010). 

When creating shortened versions of questionnaires, some authors combine 
several criteria. For example, when shortening Eysenck’s Personality Inventory, 
Fischbach and Moosbrugger (2007) sought to meet the criteria connected with 
both the orthogonality of factors and the reliability of scales. 

Still another procedure was applied by Donnellan and colleagues (2006) in 
creating the 20-item version of the originally 50-item questionnaire measuring 
five personality traits. This procedure will be described in greater detail because 
we used it in our research. 


The Procedure of Shortening the Questionnaire 
Proposed by Donnellan and Colleagues 


The process through which Donnellan et al. (2006) selected the items to be 
included in the shortened version comprised several stages: (1) By means of 
exploratory factor analysis (EFA) with Varimax rotation, factor loadings were 
computed for each of the 50 items; also computed were their cross-loadings on 
the remaining factors. (2) Next, the mean of absolute values of all the cross- 
loadings was computed for each item. (3) The obtained mean was subtracted 
from the absolute value of factor loading of a given item on the respective factor. 
The discrimination score was thus obtained, being an indicator of the extent to 
which a given item was a good measure of a particular trait independently of 
the influence of other factors. This procedure was meant to guarantee the con- 
struction of scales that were both internally consistent and independent of one 
another. (4) Finally, two positively keyed items with the highest discrimination 
scores were selected for each scale, and two negatively keyed items with the 
highest discrimination scores were selected likewise, in accordance with the re- 
commendations of Saucier and Goldberg (2002), so that the scales were balanced 
with regard to the number of positively and negatively keyed items. 

Following the above procedure, the authors computed the discrimination 
score for each item. Item selection on its basis was impossible in a few cases 
because of the criterion described in point 4, according to which every shortened 
scale was to consist of two positively keyed and two negatively keyed items. The 
problem was that the Emotional Stability scale in the questionnaire being short- 
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ened had only two positively keyed items and the Intellect scale had only three 
negatively keyed ones (Donnellan et al., 2006). 

Donnellan and colleagues (2006) verified the selected items in EFA proce- 
dure on a different sample. The hypothesized model was in principle confirmed, 
but a few items were found not to have acceptable factor loadings. For this rea- 
son, they were replaced with items that took the next places in the first EFA in 
terms of discrimination score (in the case of the Conscientiousness scale) or with 
items that better conveyed the theoretical meaning of the construct (in the case of 
the Intellect scale; Donnellan et al., 2006). 

Thus, the procedure proposed by Donnellan et al. (2006) was not applied 
mechanically. It rather constituted a set of guidelines concerning item selection, 
but it was slightly modified where appropriate, if the modifications increased the 
chance of selecting such items that would make up a better-quality measure. 


The 20-Item Questionnaire 
for Measuring the Big Five 


The aim of the presented research was to develop a short (20-item) version 
of the Polish adaptation of the IPIP-BFM-50 questionnaire for measuring five 
personality traits (Strus, Cieciuch, & Rowinski, 2014b) using the procedure pro- 
posed by Donnellan and colleagues (2006). 

The 50-item questionnaire, from IPIP resources, was named IPIP-FFM by 
Donnellan et al (2006), where FFM stands for Five-Factor Model. It is worth 
noting, however, that this is a questionnaire measuring the five basic personality 
traits as identified in the lexical approach (Goldberg, 1990, 1992). The differenc- 
es between the lexical Big Five and the Five Factor Model derived from the psy- 
chometric approach are not fundamental and the two terms are often used inter- 
changeably (John & Srivastava, 1999; De Raad & Perugini, 2002). Yet, from the 
point of view of IPIP resources, the difference is important, since those resources 
include questionnaires for measuring both the lexical Big Five and the traits from 
the Five-Factor Model of Personality (Saucier & Goldberg, 2002). The 50-item 
questionnaire, whose shortened version was developed by Donnellan and col- 
leagues (2006), is the questionnaire equivalent of Big Five Factor Markers 
(BFM), Goldberg’s (1992) list of 100 adjectives that turned out to be the best 
lexical markers of the Big Five in the English language. Because IPIP resources 
also include questionnaires for measuring five personality traits in the psychome- 
tric tradition (Five-Factor Model, FFM), we precisely distinguish between IPIP- 
BFM (the lexical approach) and IPIP-FFM (or IPIP-NEO, the questionnaire ap- 
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proach) measures. In the light of the above, the name used by Donnellan and 
colleagues is not accurate. For this reason, we will be abbreviating the question- 
naire’s name to IPIP-BFM-50 (cf. Strus et al., 2014b). Donnellan and colleagues 
(2006) refer to the 20-item version of the measure as Mini-IPIP. We will be refer- 
ring to our 20-item version as the IPIP-BFM-20. 

The use of the IPIP-BFM-50 in analyses is justified by several reasons. 
Firstly, the questionnaire serves to measure personality in the commonly used 
Big Five Model. Secondly, it is a Polish adaptation of the IPIP-BFM-50 ques- 
tionnaire, which was also the point of departure for the shortened version devel- 
oped by Donnellan and colleagues (2006). In this respect, our study is a replica- 
tion of the research carried out by Donnellan et al. (2006). Previous analyses 
show that Donnellan’s Mini-IPIP is nearly equal in psychometric terms to its 
50-item prototype (Credé et al., 2012; Donnellan et al., 2006; Thalmayer et al., 
2011). Thirdly, the IPIP-BFM-50 is available free of charge from the public do- 
main at www.ipip.ori.org (Goldberg et al., 2006), whose Polish version — 
www. ipip.edu.pl — has also been developed (Strus, Cieciuch, & Rowiński, 2011, 
2013). Fourthly, the Polish adaptation of the IPIP-BFM-50 is available as 
adapted by Strus et al. (2014b). Taking into account Thalmayer at al.’s (2011) 
argumentation, we decided to repeat the procedure used by Donnellan et al. 
(2006) rather than adopt their short version. The procedure of item selection ap- 
plied by Donnellan et al. (2006) may bring somewhat different results with dif- 
ferent data. This is of particular importance when data has been collected in 
a different country and by means of a different language version, as in our case. 
In Poland, the IPIP-BFM-50 questionnaire is an adaptation of that measure from 
the English language. It can therefore be expected that some items may have 
different properties in the analyses carried out on data from the Polish sample. 


The Present Study 


We applied Donnellan’s procedure on Polish data collected using IPIP-BFM- 
50 (Strus et al., 2014b). However, we made a significant modification to the pro- 
cedure by adding another stage, verifying the quality of measurement. We veri- 
fied the effects of item selection in confirmatory factor analysis (CFA), carried 
out on a different sample. During item selection we also used Cronbach’s alpha 
as an additional criterion. 

Moreover, we verified the validity of the measure by analyzing its relation- 
ships with four other questionnaires for measuring the Big Five. 
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METHOD 


Participants 


The study was carried out using the paper-and-pencil method. The partici- 
pants were 903 people aged from 16 to 83 years (Mage = 30.97; SDage = 13.82); 
55% of them were women. The participants lived in various regions of Poland, 
but the vast majority of them (73.1%) came from central Poland. A considerable 
percentage of the participants (43.3%) lived in cities with over 500,000 inhabi- 
tants, 9.7% lived in towns with a population between 100,000 and 500,000 
people, and 29.7% lived in towns with less than 100,000 inhabitants; 17.2% of 
participants lived in the countryside. About 31% of participants had higher edu- 
cation and just as many were still university students. Incomplete higher educa- 
tion was declared by 6.5% of the participants, post-secondary education — by 
2.5%, secondary — by 21%, vocational — by 3.8%, and elementary — by 3.6%. 
The research was conducted by trained students; each of them examined a few 
people. Participation was voluntary and anonymous. 


Measure 


The IPIP-BFM-50 


In our study we used items included in the IPIP-BFM-50 (Goldberg, 1999; 
Goldberg et al., 2006) as adapted into Polish by Strus et al. (2014b), based on 
which we constructed the IPIP-BFM-20. 

The IPIP-BFM-50 questionnaire comprises 50 items, 10 per each scale. Par- 
ticipants indicated their answers on a 5-point scale from 1 (very inaccurate) to 5 
(very accurate). Both the IPIP-BFM-50 and the IPIP-BFM-20 measure five per- 
sonality traits in the lexical tradition. Table 1 presents their brief descriptions as 
proposed by Strus et al. (2014b). 
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Table 1 


Description of the Five IPIP-BFM-50 Scales (Strus et al., 2014b) 


Individuals who score high 


Individuals who score low 


Scale Object of measurement 5 3 
J may be described as: may be described as: 
The level of activity, active, energetic, extra- introverted, reserved, quiet, 
: energy, as well as sociabili- verted, talkative, bold, and and socially inhibited. 
Extraversion ; hi 
ty and social confidence assertive. 
(assertiveness). 
Positive (vs. negative) trustful, kind, considerate distrustful, selfish, unkind, 
Agreeableness attitude towards people. and warm as well as coop- rude, and emotionally cold 
erative and helpful. towards other people. 
The level of organization, organized, diligent, tho- unsystematic and inconsistent, 
diligence in pursuing goals rough and efficient in what unconcerned with order and 
Conscientiousness and performing tasks as they do as well as systemat- planning, negligent, careless, 
well as proneness to order ic and dutiful. and undependable. 
and dutifulness. 
The level of reactivity imperturbable, calm, anxious, nervous, moody, 
Emotional and emotional stability, relaxed, not prone to nega- prone to worry and oversensi- 
Stability emotional resistance and tive emotional states. tive as well as envious, touchy, 
tolerance to frustration. prone to anger and irritation. 
Intellectual openness, intellectually active and unintellectual, noninquisitive, 
creativity, and imagination. cognitively open, creative, unimaginative, simple, unso- 
Intellect introspective, having phisticated, unreflective and 


a vivid imagination and 
a wide range of interests. 


uncreative. 


Other questionnaires measuring 
the Big Five traits 


Personality traits were also measured using four other questionnaires. These 
were: (1) the NEO-PI-R questionnaire by Costa and McCrae (1992) adapted into 
Polish by Siuta (2006); (2) the IPIP-NEO-PI-R questionnaire, measuring the 
same personality traits as the NEO-PI-R but derived from IPIP resources. The 
Polish version of the IPIP-NEO-PI-R was prepared by Rowinski, Strus, Cieciuch, 
and Wieman. (3) Big Five Aspects Scales (BFAS; DeYoung, Quilty, & Peterson, 
2007), measuring five personality traits and their ten aspects. The Polish version 
of the questionnaire was prepared by Strus, Cieciuch, and Rowinski (2012). 
(4) The IPIP-45ABSC questionnaire, measuring 45 variables in the AB5C model 
(Abridged Big Five Dimensional Circumplex) developed by Hofstee, De Raad, 
and Goldberg (1992). This measure was adapted into Polish by Strus, Cie- 
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ciuch, and Rowinski (2014a). Measurements using the above questionnaires were 
carried out between two and six weeks after the measurement using the 
IPIP-BFM-50. 


Analysis Plans 


We carried out statistical analyses on two study groups, obtained in the cross- 
-validation procedure, in which we divided the group of N = 903 participants 
randomly into two subgroups (Browne, 2000). We performed exploratory statis- 
tical analysis on one of the subgroups and confirmatory analysis on the second 
one. 

Based on the results of EFA performed on the first group (n = 467, Mage = 
= 31.58, SDage = 14.40, women = 57%), we selected items to be included in the 
short version of the questionnaire (following the procedure described above) and 
computed the values of Cronbach’s alpha for the short scales made up of the 
selected items. In the second group (n = 436, Mage = 30.26, SDage = 13.17, women 
= 54%), we checked the psychometric properties of the 20-item questionnaire by 
performing CFA and reliability analysis. 

Additionally, we performed a correlation analysis of many methods, entering 
the results of Big Five measurement using the IPIP-BFM-50, the IPIP-BFM-20, 
and four other questionnaires. 


RESULTS 


Exploratory Analyses 


Analyses aimed at selecting items for the shortened version of the IPIP- 
BFM-50 questionnaire were performed on the first group (n = 467). We per- 
formed all the analyses presented below using SPSS 20. EFA results for the 50 
items of IPIP-BFM-50 can be found in Table 2. 
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Table 2 

Factor Loadings in EFA With Forced Five Factors After Varimax Rotation, the Mean Absolute 
Value of Cross-Loadings, and Discrimination Scores (the Difference Between the Absolute Value 
of the Factor Loading of a Given Item on the Appropriate Factor and the Mean Absolute Value of 
That Items all Cross-Loadings) in Group 1 


Num- Factor Mean Discri- 
ber Item content cross- eee 
item 1 2 3 4 5 loading KEA 
39n* Have frequent mood swings .73 -01 .03 A -.01 105 .68 
34n Change my mood a lot -70 06 -.06 12 -.01 .06 .64 
14n* Worry about things 70-12 08 -04 .00 .06 62 
49n Often feel blue 66 -.21 -.10 .13 -.01 ll .55 
19n* Seldom feel blue -.63 19 .05 -.05 01 08 56 
24n Ameasily disturbed 60 -11 08 — -.03 -.10 08 52 
44n Get irritated easily 58 .03 -.12 3 .06 .09 .49 
29n Get upset easily .57 .04 -13 14 -.02 .09 .48 
4n Get stressed out easily .56 -19 .12 -01 -.03 .09 .48 
9n* Am relaxed most of the time -.43 .23 .08 .06 .20 14 28 
45i Spend time reflecting on things .38 -.05 23 .06 32 18 14 
46e* Am quiet around strangers .09 -.77 .00 .09 -.06 .06 el 
le* Am the life of the party -.03 72 .04 .05 21 08 .64 
l6e* Keep in the background 20 -70 -13 07 -.09 512 57 
lle Feel comfortable around people -.20 -66 26 .01 .07 13 52 
3le* Talk to a lot of different people at parties -.01 63 .02 -.03 19 .06 nri 
6e Don’t talk a lot 05 -63 -18 -01 -.09 .08 39 
2le Start conversations -.09 62 .19 -.03 .20 .13 49 
36e Don’t like to draw attention to myself 04 -54 04-13 -.20 10 44 
4le Don’t mind being the center of attention -.17 48 09 04 28 14 34 
2u* Feel little concern for others .03 -.11 -.62 .02 -.03 105 57 
22a* Am not interested in other people’s problems -.01 -.09 -59 09 -.04 .06 53 
32a Am not really interested in others 13-14 -58 -01 -.15 aol AT 
17a* Sympathize with others’ feelings -.06 -04 .57 -.07 -.05 .05 52 
Ta Am interested in people .08 22, 57 .02 AS 12 45 
42a Feel others’ emotions .22 .00 .57 -.03 21 12 45 
37a* Take time out for others .03 09 52 .01 .09 .05 AT 
27a Have a soft heart -.05 105 .42 -14 .08 .08 .34 
12a Insult people :26 .17 -.40 .24 13 .20 .20 
47a Make people feel at ease -.06 .20 31 03 AS J1 .20 
23c* Get chores done right away -.01 .06 -02 -66 -.07 .04 62 
8c* Leave my belongings around 04 08 -10 65 21 zi 54 
28c* Often forget to put things back in their proper place 07 .03 .00 .6l AS 06 54 
43c* Follow a schedule -.03 .00 .04 -.55 .06 .03 51 


33c Like order .07 -.05 17 -.54 -.05 08 45 
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Num- Factor Mean paa 
ber Item content cross- š 
item 1 2 3 4 5 loading oa 
38c Shirk my duties 12 04 -05 53 -.11 .08 .45 
18c Make a mess of things 26 .00 .01 52 -.05 08 44 
48c Am exacting in my work -.06 07 .07 -.47 14 .08 39 
3c Am always prepared -.09 .03 -06 -45 .10 .07 .38 
13c Pay attention to details 13 02 09 = -.29 26 13 AT 
30i* Do not have a good imagination -.09  -.13 -30 -0l -.63 13 50 
50i* Am full of ideas -.07 27 13 .01 .61 A2 49 
25i Have excellent ideas -.18 .24 10 .02 55 14 Al 
15i Have a vivid imagination 22 10 22 14 53 AT 36 
T Have a rich vocabulary -.05 3) -.01 -.14 48 09 39 
40i Use difficult words -.04 13 -.07 .02 .44 .06 37 
20i Am not interested in abstract ideas -08 -04 -06 -09 -.40 .07 .33 
35i Am quick to understand things -.17 .09 08 -14 .38 2 26 
26e Have little to say .01 -.26 -07 .03 -.33 11 22 
10i* Have difficulty understanding abstract ideas .10 -10 -05 A7 -.32 .10 .21 


Note. n — Emotional Stability items; e — Extraversion items; a — Agreeableness items; c — Conscientiousness 
items; i — Intellect items. * Items included in the final version of the IPIP-BFM-20. 


According to the results obtained by applying Donnellan’s procedure, the 
shortened version of the IPIP-BFM-50 should include the following items: 1, 16, 
31, 46 (Extraversion); 2, 17, 22, 37 (Agreeableness); 8, 23, 28, 43 (Conscien- 
tiousness); 9, 19, 34, 39 (Emotional Stability); and 20, 25, 30, 50 (Intellect). 
However, after such selection very similar items are found within a factor. This is 
the case for the Emotional Stability and Intellect scales. In the case of Emotional 
Stability, the similar items are 34 (“Change my mood a lot”) and 39 (“Have fre- 
quent mood swings”). As can be seen in Table 2, the discrimination score of Item 
14 (“Worry about things”) is lower than that of Item 34 by only 0.02. An analog- 
ous situation occurred in the case of the Intellect scale, the similar items being 50 
(“Am full of ideas”) and 25 (“Have excellent ideas”). As Table 2 shows, Item 5 
(“I have a rich vocabulary”) has a lower discrimination score that Item 25 by 
only 0.02. In order to semantically differentiate the items making up the short- 
ened version of the IPIP-BFM-50, in the final version of this measure we re- 
placed Item 34 with Item 14 in the Emotional Stability scale and Item 25 with 
Item 5 in the Intellect scale. 

For scales thus constructed, we computed Cronbach’s alpha coefficients. The 
items we entered in the analysis were those obtained after applying Donnellan’s 
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procedure and after replacing two items for semantic reasons: 34 with 14 and 25 
with 5. Cronbach’s alpha coefficient for the 4-item Intellect scale was found to be 
too low (a = .57). According to Donnellan’s procedure (Donnellan et al., 2006), in 
the shortened version of the questionnaire there should be two items measuring 
a given trait directly and two reverse-scored ones. That particular scale in the 
IPIP-BFM-50 includes three negatively keyed items: 30 (“Do not have a good 
imagination”), 10 („Have difficulty understanding abstract ideas”), and 20 (,,Am 
not interested in abstract ideas”). In order to obtain an acceptable value of Cron- 
bach’s alpha reliability coefficient for the Intellect scale and at the same time im- 
plement the recommendation to include two negatively keyed items in the short- 
ened version of the measure, we replaced one of the negatively keyed Intellect 
items. Of the two items that negatively load the Intellect scale, selected in accor- 
dance with Donnellan’s procedure, Item 20 has a lower discrimination score (.33) 
than Item 30 (.50). Based on the discrimination scores, we replaced Item 20 with 
Item 10. Thus constructed, the Intellect scale has a reliability of a = .61. 

Eventually, the IPIP-BFM-20 comprises the following items: 1, 16, 31, 46 
(Extraversion); 2, 17, 22, 37 (Agreeableness); 8, 23, 28, 43 (Conscientiousness); 
9, 14, 19, 39 (Emotional Stability); and 5, 10, 30, 50 (Intellect). 


Confirmatory Analyses 


The items selected for the short version of the questionnaire were entered in 
CFA, performed in AMOS 20. The assessment of the model’s goodness-of-fit to 
data was based on RMSEA, CFI, and SRMR indices. The threshold values of 
model acceptability were RMSEA and SRMR values below .08 and CFI value 
above .9 (Hu & Bentler, 1999; Marsh, Hau, & Wen, 2004). 

The following statistical fit indices were obtained for the proposed model 
(Figure 1): CFI = .899, RMSEA = .05, SRMR = .06. However, according to the 
modification indices, it is possible to correlate one pair of errors in the model: 
Item 23 error with Item 43 error (both items being part of the Conscientiousness 
scale). After the correlation of these errors, the model has the following good- 
ness-of-fit to data: CFI = .911, RMSEA = .05, SRMR = .06. 
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item 1 
item 46 
item 31 Extraversion 
item 16 
item2 
item 37 
item 22 Agreeableness 
item 17 
item 23 
item 28 
Conscien- 
item 8 tiousness 
item 43 
item 14 
item 39 
Emotional 
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item 10 
item 50 
item 5 
item 30 


Figure 1. The five-factor model of the short questionnaire for measuring the Big Five in Group 2 
(n = 436). 


We checked the reliability of the final versions of the questionnaire’s short 
scales using Cronbach’s alpha. The values of that coefficient reached the accepta- 
ble level for all the scales. The results obtained are presented in Table 3. 
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Table 3 
Cronbach s a Coefficients in Each Group 
IPIP-BFM-50 IPIP-BFM-20 
(50 items) (20 items) 
The entire sample Group 1 Group 2 
n = 903 n= 467 n = 436 
Extraversion .86 .82 .78 
Agreeableness .79 69 il 
Conscientiousness .76 72 73 
Emotional Stability .86 3 .70 
Intellect .80 .61 .65 


The IPIP-BFM-20 and Other Measures 
of Five Personality Traits 


In order to check the validity of IPIP-BFM-20 scales, the scores obtained us- 
ing this questionnaire were correlated with those obtained using other question- 
naires for measuring five personality traits. Table 4 presents the correlation ma- 
trix of five personality traits measured using both the IPIP-BFM-20 and the IPIP- 
BFM-50 with four other personality measures. 

Analyses of correlations confirmed the validity of the IPIP-BFM-20 ques- 
tionnaire as regards the relations between its scores and those of other scales 
measuring the same (or very similar) traits. As Table 4 shows, the values of the 
correlation coefficient between the scales of IPIP-BFM-20 and the corresponding 
scales of other measures of five personality traits are higher than correlations 
with scales measuring other traits. The highest correlation coefficients were ob- 
tained between scales measuring Neuroticism or its opposite, Emotional Stability 
(.67 or higher). Coefficients of correlation between the corresponding scales of 
Extraversion, Conscientiousness, and Intellect (or Openness to Experience) were 
equal to or higher than .54. Correlations were the weakest between scales mea- 
suring Agreeableness, their coefficient values ranging from .37 to .74. A similar 
result was explained by Strus et al. (2014b) as an effect of differences in the de- 
finition of traits between the lexical and questionnaire approaches. Warmth, be- 
ing a component of Agreeableness in the lexical tradition, falls into the scope of 
Extraversion in the questionnaire tradition. The definition of Agreeableness in 
the questionnaire approach encompasses more aspects of morality and humility 
than the corresponding definition in the lexical approach (cf. Ashton & Lee, 
2005). 
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Table 4 

Correlations (Pearsons r) Between IPIP-BFM-20 and IPIP-BFM-50 Scales and NEO-PI-R 
(n = 883), IPIP-NEO-PI-R (n = 368), BFAS (n = 297), and IPIP-45AB5C Scales (N = 903) as Well 
as Intercorrelations Among IPIP-BFM-20 Scales (N = 903) 


Emotional ‘ Conscientio- 
ie Extraversion Agreeableness Intellect 
Stability usness 


IPIP- IPIP- IPIP- IPIP- IPIP- IPIP- IPIP- IPIP- IPIP-  IPIP- 


BFM- BFM- BFM- BFM- BFM- BFM- BFM- BFM- BFM- BFM- 
20 50 20 50 20 50 20 50 20 50 


Emotional 


Stability 1 90 24" 26" 08" 09" 04 09" 14" 06 
a Extraversion 24 2 92" -.02 .02 15 18" 32 297 
5 a os’ 10" -02 -05 1 oP aa ae ar u” 
2 Agreeableness .04 07 A5 2r 11” 18% 1 88" 0.25% 24" 
Intellect 14° 16" 32" 41” -01 i 25" 307 j 88” 
Neuroticism -70° aTa 25 aar" id a8" a03 072 
x Extraversion 23" 20" 0.59 65% -.12™ -.04 26* 31" 39” 39% 
3 raed 16” 19" 00 03 55° 67" 4 8 13" 08° 
Z Agreeableness — -.03 00-13-16" 142" 37" 46-09" -13% 
Openness -06 — -.04 22" 30" -.13" -03 30° 36% (54 64" 
Neuroticism -67 -68%  -27" -29% -11% 16" -03 -06 -18% -10 
í Extraversion QT 23" 55 638 -16% -16% 21 25 B32" 
z bad i at 02 0l 56° 67" 18" 20" 15" 04 
3 Agreeableness -.09 -04 -08 -09 23" 26% AT” 56” 03 -04 
7 Openness -09 -07 23" 31% -09 01 30° 33" 58 65" 
Neuroticism -69" = -.76" -20 -21% -10 -147 -07 -11 -17% — -.09 
Extraversion 32" 32" 61" 67-06 07 28" 31 4a 4 
£ Conscientio- 16" 17% 02 Ol 56% 62 10 16° -02 -09 
m usness 
Agreeableness -.04 02 -.13* -09 .13* 17% 54" 63% 09 .05 
Openness .10 13° .18™ .27* 06 16% 28" 37 57" 61” 
monona 17° 87" 07 06 is 19" 0510" 10% -02 
T Stability 
® Extraversion 33° 28" 80" 890 08" 002228 4a A3” 
$ posal AT” 21" 02,0376" 88 13 1713" 03 
a usness 
= Agreeableness -.05  -.01 .05 os 20 25" 74 ag a as” 
Intellect .03 .05 24 33" 01 A931" 3T TT 87” 


Note. * p < .05; ** p < 01. 


400 E. TOPOLEWSKA, E. SKIMINA, W. STRUS, J. CIECIUCH, T. ROWINSKI 


Moreover, the differences between correlation coefficients of corresponding 
IPIP-BFM-20 and IPIP-BFM-50 scales were found not to be higher than .12. 
Given that IPIP-BFM-50 scales consist of 10 items while in the shortened ver- 
sion there are four items, the decrease of correlation coefficients by .12 may be 
regarded as small. 


Analyses based on the procedure proposed by Donnellan and colleagues 
(2006) made it possible to construct the Polish 20-item IPIP-BFM-20 question- 
naire, being a shortened version of the IPIP-BFM-50, which serves to measure 
the Big Five in Goldberg’s lexical model. The IPIP-BFM-20 questionnaire is 
characterized by good validity (satisfactory results in CFA fit indices and correla- 
tion coefficients with other Big Five measures) as well as reliability (acceptable 
Cronbach’s a values). 

The results of the analyses carried out so far suggest that the Polish version 
of the IPIP-BFM-20 is at least as good an instrument for measuring the Big Five 
as its English-language counterpart. It may therefore be useful in all kinds of 
research where reducing the number of items in the personality questionnaires is 
advisable. 
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